Using Genetic Programming to Evaluate the Impact of Social Network Analysis in Author Name Disambiguation
نویسندگان
چکیده
In digital libraries, which have become extremely popular in the scientific community, often people want to find publications by an author using the author name as a query. However, since authors may have many denominations and one denomination may refer to many authors, name searches may present ambiguous results. To tackle this problem, several studies have been developed. Recently the use of social networks has been studied in author name disambiguation. In this article, we use a machine learning approach based on Genetic Programming to evaluate the impact of social network analysis in author name disambiguation. Through experiments using real-world data, we show that social network analysis greatly improves the quality of results. Also, we demonstrate that our approach is able to compete with state-of-the-art techniques.
منابع مشابه
Evaluating the Use of Social Networks in Author Name Disambiguation in Digital Libraries
Digital libraries have become an important source of information for scientific communities. However, by gathering data from different sources, the problem of duplicate and ambiguous information about author names arises. Traditional methods of name disambiguation use syntactic attribute information. However, recently the use of relationship networks has been studied in data deduplication. This...
متن کاملبهبود صحت ابهامزدایی نام نویسنده با استفاده از خوشهبندی تجمّعی
Today, digital libraries are important academic resources including millions of citations and bibliographic essential information such as titles, author's names and location of publications. From the view of knowledge accumulation management, the ability to search fast, accurate, desired contents, has a great importance. The complexity and similarity in these resources cause many challenges and...
متن کاملInvestigating Association between Social influence, Productivity, and Performance in Co-author Network of Researchers in Medical Ethics
The purpose of this research is to investigate association between social influence, productivity, and performance among researchers of medical ethics field. This research was done using common methods in scientometric studies with the method of co-author and network analysis. The statistical population of the study consists of all articles published in journals in the field of medical ethics,...
متن کاملSustainability in paper industry closed-loop supply chain (case study: East Azerbaijan province, Iran)
Governments and customers are forcing the paper manufacturers to become more sustainable. Accordingly, there still exists a gap in the quantitative modeling of these issues. In this paper, this gap is covered through simultaneously considering economical, environmental and social impacts in the paper closed-loop supply chain network design. The proposed multi-objective, multi-echelon, multi-pro...
متن کاملAn Application of Genetic Network Programming Model for Pricing of Basket Default Swaps (BDS)
The credit derivatives market has experienced remarkable growth over the past decade. As such, there is a growing interest in tools for pricing of the most prominent credit derivative, the credit default swap (CDS). In this paper, we propose a heuristic algorithm for pricing of basket default swaps (BDS). For this purpose, genetic network programming (GNP), which is one of the recent evolutiona...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010